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Abstract 

We study the complexity of computing equilibria in two classes of network games based on flows 
- fractional BGP (Border Gateway Protocol) games and fractional BBC (Bounded Budget Connection) 
games. BGP is the glue that holds the Internet together and hence its stability, i.e. the equilibria of 
fractional BGP games [15], is a matter of practical importance. BBC games |22] follow in the tradition 
of the large body of work on network formation games and capture a variety of applications ranging 
from social networks and overlay networks to peer-to-peer networks. 

The central result of this paper is that there are no fully polynomial-time approximation schemes 
(unless PPAD is in FP) for computing equilibria in both fractional BGP games and fractional BBC 
games. We obtain this result by proving the hardness for a new and surprisingly simple game, the 
preference game, which is reducible to both fractional BGP and BBC games. 

We define a new flow -based notion of equilibrium for matrix games - personalized equilibria - which 
generalizes both fractional BBC games and fractional BGP games. We prove not just the existence, but 
the existence of rational personalized equilibria for all matrix games, which implies the existence of 
rational equilibria for fractional BGP and BBC games. In particular, this provides an alternative proof 
and strengthening of the main result in |15 |. For fc-player matrix games, where fc = 2, we provide a 
combinatorial characterization leading to a polynomial-time algorithm for computing all personalized 
equilibria. For fc > 5, we prove that personalized equilibria are PPAD-hard to approximate in fully 
polynomial time. We believe that the concept of personalized equilibria has potential for real-world 
significance. 



1 Introduction 



This paper concerns two classes of games on networks involving fractional flows — fractional BGP 
games and fractional BBC games. These games model important practical systems such as the Internet and 
social networks. The stable operating points of these systems have real-world significance and hence there 
is interest in their pure Nash equilibria. In order to understand the structure and computational complexity 
of these equilibria, we define two new concepts — personalized equilibria for matrix games and preference 
games — which are of independent interest. Below we briefly describe and motivate each of the four 
different kinds of games. 

Fractional BGP games. The Border Gateway Protocol is the core routing protocol of the Internet. 
BGP can be viewed as a distributed mechanism for solving the stable paths problem [13]. In this paper, we 
refer to the fractional version of the stable paths problem introduced in |[T5l as the fractional BGP game. 
Intuitively, the fractional BGP game is a game played between Autonomous Systems that assign fractional 
capacities to the different paths leading to the destination in such a way that they maximize their utility 
without violating the capacity constraints of downstream nodes. Clearly, the equilibria of this game have 
significant implications for the stability of the Internet. 

Fractional BBC games. Consider a social network where people have to spend time and (cognitive) 
resources to build connections to people. This situation naturally lends itself to being modeled by the 
Bounded Budget Connection game ll22l l23l . In a BBC game, strategic nodes acting under a cost budget 
form connections with a view to optimizing their proximity to nodes of interest, which in turn depends on 
other nodes' strategic actions. BBC games belong to the much studied class of network formation games. 
These games have applications to a variety of problems ranging from "how to monetize a social network" to 
"how to structure incentives in a peer-to-peer network to reduce congestion." Fractional BBC games were 
defined in ll23]i . which left unresolved the complexity of finding their Nash equilibria. 

Personalized equilibria for matrix games - a generalization. Imagine a business selling outfits con- 
sisting of a pant (solid or striped) and a shirt (cotton or wool). The manager of one location decides on the 
ratio of striped to solid pants while the manager at the other decides on the ratio of cotton to wool shirts. 
Each manager is given the same number of shirts and pants (in the proportions decided) and has to assemble 
and sell the outfits at her own location in such a way as to maximize her individual profits. Personalized 
equilibria for matrix games capture exactly this situation: each player chooses a distribution over her own 
actions, but then each player independently customizes the matching of her own actions to the actions of 
other players in such a way as to maximize individual payoff. The concept of personalized equilibria for 
matrix games generalizes both fractional BGP and fractional BBC games. 

Preference games - a specialization. It is New Year's Eve. You and each of your friends is hosting a 
party. Each of you has a preference order over the others' parties and has to determine the fraction of the 
evening that you will spend at each party. Naturally, one cannot spend more time at a party than the person 
hosting that particular party. Your optimal action - how long to host your party and which other parties to 
attend for how long - depends on your preference and other players' actions. Such preference games arise 
whenever each player has a preference among her actions and her distribution over her actions is somewhat 
constrained by others' distributions. Preference games are reducible in polynomial-time to both fractional 
BGP and BBC games. 

1.1 Our Contributions 

Our paper centers on the study of two classes of flow-based network games — fractional BGP games 
and fractional BBC games, formally defined in Section[2] We address the following two questions: 
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1 . Does a Nash equilibrium always exist and if so what does the set of equilibria look like? 

2. From a computational standpoint how difficult is it to find a Nash equilibrium? 

To answer the first question, we define a new flow-based notion of equilibrium for matrix games - per- 
sonalized equilibria. Personalized equilibria for matrix games constitute a novel and useful generalization 
of the concept of Nash equilibria for both fractional BGP as well as fractional BBC games. By employing 
this generalization, we are able to show the following results for multi-player matrix games. 

• We show that the set of personalized equilibria for any multi-player matrix game is always nonempty 
and contains a rational point, though it may be nonconvex (Section |4|. 

It follows that a rational equilibrium always exists for both fractional BGP and BBC games. We provide an 
alternate characterization for BGP games that enables a simpler existence proof and strengthens the main 
result in lfT5ll . We expect that personalized equilibria will be applicable elsewhere, since they capture real- 
world situations in which players have the opportunity to customize their use of others' actions. 

To answer the second question, we create a new combinatorial A; -player abstract game - the preference 
game (see Section [3]). Preference games are extremely elementary games that are a simultaneous simplifi- 
cation of both fractional BGP and fractional BBC games. By employing this simplified abstraction, we are 
able to obtain the following by reduction from a Brouwer fixed point problem ||5l|6l|7]|. 

• There are no fully polynomial-time approximation schemes (unless PPAD is in FP) for computing 
equilibria in preference games (Section [3]l. 

It follows that there are no fully polynomial-time approximation schemes (unless PPAD is in FP) for com- 
puting equilibria in fractional BGP and fractional BBC games, as well as personahzed equihbria in multi- 
player matrix games. Our result for fractional BGP games settles a question left open in [15], while our 
result for fractional BBC games settles an open question from 1221 . Finally, we study the complexity of 
personalized equilbria in fc-player matrix games, for fixed k. 

• For k = 2, we provide a combinatorial characterization which implies a polynomial- time algorithm 
for computing the personalized equilibrium (Section |4]l. 

• For A; > 4, it is PPAD-hard to find personalized equilibria. Furthermore, for A; > 5, there is no fully 
polynomial time approximation scheme (unless PPAD is in FP) for finding personalized equilibria 
(Section |4]). 

1.2 Related Work 

Nash equilibrium f2^. "26^ is arguably the most influential solution concept in game theory. Decades 
after Nash, Papadimitriou defined a complexity class PPAD [29.1 to characterize proofs that rely on parity 
arguments. Recently, an exciting breakthrough made in Q and strengthened in showed the hardness 
of approximating Nash equilibria. Since then, there has been a flurry of work on the complexity of finding 
equilibria in a variety of games and markets ETl . All of our hardnesss proofs build on the framework 
established by fSl l6l l7l| and heavily use their techniques. 

BGP has been the focus of much attention since its inception lOTllBH . The integral stable paths problem 
was introduced llT3l to explain the nonconvergence of BGP ||33]| . The fractional relaxation of the stable paths 
problem, or the fractional BGP game, was defined in fTSl where they proved the existence of an equilibrium 
but left open the complexity of finding it. [20| gives a distributed algorithm for finding an e-approximation 
for the fractional BGP game that is guaranteed to converge, although no bounds are given on the time-to- 
convergence (the main result in this paper implies a polynomial upper bound is unlikely). Other related 
works include a multicommodity version Il30ll24l and mechanism design lITll . 
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The BBC game, introduced in ll22l |23]| . builds on a large body of work in network formation games 
ifTTl l4l. A direct precursor to BBC games was introduced in lITOl . ifTOl . which together with subsequent 
works fT, 'S'l, focuses on obtaining price of anarchy results f2\\. In f22\, it is shown that it is NP-hard 
|[T2l to determine whether an equilibrium exists in integral BBC games. Fractional BBC games were also 
introduced in [23], but the problem of finding an equilibrium was left open. Other related works include a 
stochastic small-world version Q and use of contracts ||2l[T8]|. 

2 Definitions 

In this section, we define fractional BGP and fractional BBC games. Our definitions lead to the existence 
of Nash equilibria for these games using standard fixed-point techniques. We defer the formal proofs of 
existence of equilibria, however, to Section |4j where we establish the existence of a more general class of 
equilibria that includes the equilibria for both fractional BGP and BBC games. 

2.1 Fractional BGP 

The fractional BGP game is based on a new model of [15], introducing the notion of a fractional stable 
paths solution in the context of BGP. We first present the model of [ 15], and then define the fractional BGP 
game, whose Nash equilibria are equivalent to fractional stable paths solutions. 

Let G be a graph with a distinguished node d, called the destination. Each node v ^ d has a list ■k{v) 
of simple paths from v to d and a preference relatioi|^ >t, among the paths in t:{v). For paths P and P' in 
P >v P' indicates that v prefers P at least as much as P'. We say that P P' if P >y P' is true 
but P' >i; P is not true. When it is clear from context that we are talking about the preferences for node 
V, we will write P > P' instead of P >y P'. For a path S, we also define it{v, S) to be the set of paths in 
7r{v) that have 5 as a suffix. A proper suffix of P is a suffix of P such that S ^ P and 5" / 0. 

A feasible fractional paths solution is a set u; = {w^ : v ^ d} of assignments : 'k{v) — > [0, 1] 
satisfying the following: 

1. Unity condition: for each node v, '}2pi^-k{v) '^vi^) ^ 1 

2. Tree condition: for each node v, and each path S with start node u, Ylpeniv s) "^viP) < WuiS). 

In other words, a feasible solution is one in which each node chooses at most 1 unit of How to d such that no 
suffix is filled by more than the amount of flow placed on that suffix by its starting node. A feasible solution 
w is stable if for any node v and path Q starting at v, one of the following holds: 

(51) J2peTT{v) 'Wv{P) = 1, and for each P in 7r{v) with w^{P) > 0, P >^, Q; or 

(52) There exists a proper suffix 5 of Q such that J^Penl^v s) '^v{P) = Wu{S), where u is the start node 
of S, and for each P £ 7r{v, S) with Wy{P) > 0, P >v Q- 

In other words, in a stable solution: if node v has not fully chosen paths that it prefers at least as much as Q, 
then it has completely filled path Q by filling some suffix with paths it prefers at least as much as Q. 

We now define the fractional BGP game. For convenience, let w-^ denote {wu '■ u / d,v}. Given 
assignments w^, w'^, and W-y such that {wv^W-^) and {w'^^W-^) are both feasible, we say is lexico- 
graphically at least w'^ (implied: with respect to w^y) if the following holds for every path P in 7r{v): 
X]p'>p Wy{P') > X]p/>p w'y{P'). We say that Wy is lexicographically maximal (implied: with respect to 
w-y) if {wy,w-y) Is fcaslblc and Wy is lexicographically at least every assignment w'y such that {w'y,w-y) 
is feasible. 

'a preference relation is a binary relation that is transitive and complete. 
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In the fractional BGP Game, a strategy for a node 7^ (i is a weight function : tt{v) — > [0, 1] that 
satisfies the unity and tree conditions, and the preference relation among the strategies of a node v is defined 
by the lexicographically at least relation. (Thus, a node's best response is a lexicographically maximal flow.) 

We can now show that fractional stable paths solutions are equivalent to (pure) Nash equilibria in the 
fractional BGP game. We note that 1201 has also independently shown that a fractional stable paths solution 
is a Nash equilibrium of a suitably defined game. 

Theorem 1. A fractional paths solution is stable iff it is lexicographically maximal for every node. 

Proof. A stable paths solution is a lexicographically maximal flow. Let w be a fractional stable paths 
solution. Assume, for the sake of contradiction, that Wy is not lexicographically maximal with respect 
to W-y. Then there exists an assignment w'^ such that {w'^,w-v) is feasible and w'^ is lexicographically 
greater than with respect to W-^. Among all such assignments, we set to be an assignment such that 
the preference of the highest preference path at which and w* differ is smallest. Let P be the highest 
preference path at which they differ and let V denote the set of all paths with the same preference as P. 

By the definition of stability, at least one of the two stability conditions must hold for P mw^. First, 
assume (SI) is satisfied. Then we have X^p/g7r(i;) 'Wv{P') = 1 and each P' G ^{v) with Wv{P') > is 
such that P' > P. This implies that J2p'>p''^v{P') = 1- But since w* satisfies the unity condition, we 
have J2p'>p '^v{P') — 1- However, this means that Wv is lexicographically at least w* (by definition of 
"lexicographically at least"), so tf* is not lexicographically greater than w^, a contradiction. 

If (SI) does not hold for P, then condition (S2) must be satisfied for each path in V. For each Q ^ V, 
there exists a proper suffix Sq (say with start node u) of Q such that X]p'e7r(D Sq) '^v{P') = Wu{Sq), and 
each P' G t:{v, Sq) with Wv{P') > is such that P' >y P; for each Q, we set Sq to be the smallest 
such suffix. By our choice of Sq for each Q, we obtain that for Q,Q' G V, 7r{v, Sq) and tt{v,Sq) 
are disjoint if Q 7^ Q'. Since {wl,W-y) satisfies the tree condition, we have J^p'eniv Sq)'^v(.P') ^ 
WuiSQ). Therefore, using the fact that 7r(f,5Q)'s are all disjoint, J2QevT^P'e7r{v,SQ):P'>P "^K^') - 
Y^QeV ^"('S'q) = Z^QeP ^P'<=TT{v,SQy.P'>P Wv{P')- Furthermore, since is identical to on all paths 
more preferred than the paths in V, we obtain that XIqgP "^viQ) ^ J^Qev Wv{Q). 

We now consider two cases. If J^QeV "^viQ) < X^QeT' ''^viQ), then is lexicographically greater 
than w*, leading to a contradiction. Otherwise, we derive a new assignment w'^ that is identical to w^, 
except on paths in V, where it is identical to w^. This new assignment w'^ is lexicographically greater than 
Wv, since was lexicographically greater; the highest preference path at which it differs from Wv, however, 
has lower preference than that for w*, contradicting our choice of w*. 

A lexicographically maximal flow is a stable paths solution. Let Wy be a lexicographically maximal flow 
with respect to W-^. Consider any path Q that starts at a node v. Suppose, for the sake of contradiction, Q 
does not satisfy either of the two stability conditions. That is, we have (i) X]p>Q Wy{P) < 1, and (ii) for 
each proper suffix 5 of Q with start node u, we have J^Pewiv s) P>Q '^v{P) < Wu{S). We derive a new 
assignment w'y which is identical to Wv except for the following: w'y{Q) = Wv{Q) + e, for a suitably small 
e > 0; for each proper suffix 5 of Q, if there exists a path that is less preferred than Q, shares S, and has 
positive weight, then we select one such path P and set w'y{P) = Wy{P) — e. It is easy to see that w'^ satisfies 
the unity and tree conditions. However, w'^ is lexicographically greater than Wy, a contradiction. □ 

2.2 Fractional BBC 

We define a fractional variant of the Bounded Budget Connection game, as in f23\. A fractional Bounded 
Budget Connection game (henceforth, a fractional BBC game) is specified by a tuple {V,d,c,b), and a 
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length function £u for each u G V, where V is a set of nodes, d G V is a distinguished destination node, 
c : V X V ^ Tj, b : V ^ Ij, and £u : V x V ^ Z (for each u ^ V) are functions. For any u,v V, 
c{u, v) denotes the cost to u of directly linking to v, and ix{u, v) denotes the length of the link {u, v) from 
the perspective of x, if u has established this link. For any node u £ V, b{u), specifies the budget u has 
for establishing outgoing directed links: the sum of the costs of the links established by u times the amount 
placed on each link should not exceed b{u). 

A strategy for node n is a weight function Wu '■ V ^ [0, 1] that u places on each outgoing edge 
{u,v) : V e V such that Y^^^ c{u, v) x Wu{v) < b{u). Let Wu denote a strategy chosen by node u and let 
W = {wu '■ u G V} denote the collection of strategies. The network formed by W is simply the directed, 
capacitated complete graph G{W), in which the capacity of the directed edge (n, v) is Wu{v). The utility of 
a node u is given by —f{u), where f{u) is the cost of a 1-unit minimum cost flow from u to d, according 
to the capacities given by W and the lengths from the perspective of u given by lu- We assume that there is 
also always an additional edge from each node to d with cost 0, capacity cxd, and length = some large integer 
M S> nm.aY.x^u,v £x{u, v); we refer to M as the disconnection penalty. In other words, if the max flow from 
u to V is a < 1, then /(n) is the cost of the minimum cost a flow from uto d plus (1 — q) • M. 

3 Hardness of Finding Equilibria 

In this section, we define a very simple game, the preference game, which is a special case of both frac- 
tional BGP and fractional BBC games. In Section [3]2j we show that the set of all equilibria in a preference 
game is not convex, implying that we cannot hope to find an equilibrium for fractional BGP or BBC games 
using convex programming. We next present, in Section |3.3[ our main result: it is PPAD-hard to find an 



equilibrium in the preference game. Finally, in Section 3.4 we define an e-approximate equilibrium for 
the preference game, which encompasses two previously-defined notions of approximation for fractional 
BGP. We extend our PPAD-hardness result to approximate equilibria, thereby proving that there are no fully 
polynomial-time approximation schemes (unless PPAD is in FP) for computing equilibria in both fractional 
BGP games and fractional BBC games. 

3.1 Preference Games 

We begin by defining preference games. In a preference game with a set S of players, each player's 
strategy set is S. Each player i ^ S has a preference relation >j among the strategies. Each player i 
chooses a weight distribution, which is an assignment wi : S ^ [0, 1] satisfying two conditions: (a) the 
weights add up to 1: Ylij^s'^'^^j) ~ ^' ^'^'^ ^^'^ weight placed by i on j is no more than the weight 
placed by j on j: Wi{j) < Wj{j) for all i, j e 5. As in the case of fractional BGP, the preference relations 
>i induce a preference relation among the weight distributions as follows: Wi is lexicographically at least 
w[ if for all j G S, Ylk>ij ^«(^) — "^^ky^j ^i(^)- equilibrium in a preference game is an assignment 
w = {wi : i £ S} such that Wi is lexicographically maximal for all i G S. 

We now show that the preference game is a special case of both fractional BGP and fractional BBC. 

Lemma 1. There is a polynomial-time reduction from the preference game to the fractional BGP game. 

Proof. Consider any instance P of the preference game, consisting of a set of players S and a preference 
relation >j for each i £ S. We will create an instance B of fractional BGP. For each player i G 5, create a 
node i in B. Also create a universal destination node d. For all i ^ d, define Pa = the path (i, d). For all 
i,j / d, define Pij = the path d). For all i: define 7r(i) (the set of i's preferred paths in B) as the set 
{Pij ■ j >i i}- If k >i j in P, then Pit >i Pij in B. 

Consider any feasible solution w = {wi] for P, and define weights w' = {w'j} for B where Vi,j, 
w[{Pij) = Wi{j), and w[{P) = for all other paths P. Because w is feasible, for any i £ S, Y^ji^s = 
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1. Therefore, for all i, J^paths p ^U-^) = Y.p,y.jes '^'ii^ij) = ^jes "^iU) = 1' ^^'^ ^^^^ condition for 
B is satisfied. Also, for all i, j G S, Wi{j) < Wj{j). Therefore, for all paths i and all paths P starting at j, 
^Pije7r(j P) '^'ii^ij) — '^[{Pij) — '^iU) ^ '^jU) = '^ji^jj)' ^^e tree condition for B is satisfied, and 
w' is feasible. 

Consider an equilibrium w' = {w'^} of B, and define weights w = {wi} for P where Vz, j, Wi{j) = 
w[{Pij). Since this is an equihbrium of B, it must be feasible and lexicographically maximal. Because it is 
feasible, for each node i, "^^jP- ■e-K{i) — ^^^^ implies that in our new solution for P, Yl,j j> i ^ 

1. Because it is lexicographically maximal. Pa >i Pij =^ w^{Pij) = 0, so Y^jas ^»(-?) = '^j■.j>^i ^«(-?) - 
1. Futhermore, since Pa >i the empty path, w[{Pii) = 1 - J2p,j&TT(i),jyti '^iiPij)^ and J2jes "^^O') = 1' 
required in the preference game, w' is feasible also implies for each node i, and each path P with start node j, 
we have Ep,,e7r(i,P) '^ii^ij) < w'j{P). However, {P G 7r(j)} n {P : 7r(i, P) / 0} = {P„}, by definition 
of the preference sets. So ^^Zp^^e-Kii P)^'i^^ii) ~ ^ unless P = Pjj, and {P G Ti{i,Pjj)} = {Pij}, so 
if = Pjj then Epi^e7r(i,p,-,) w'i{Pij) = w[{Pij) < w'j{Pjj). Therefore, Wi{j) < Wj{j), as required for 
feasibility in P. 

Now, consider any other feasible assignment w = {wi} U {wj : j i} for P. Define w[{Pij = Wi{j). 
Then w' = {w[} U {wj j i} is feasible for B, as shown above, and lexicographic maximality of w' 

says that for every path Pij in 7r(i), Ep^^>.p.^. w^{Pik) > J2p^^.>iPij w[{Pik)- Therefore, for every j G S, 
^k>ij ^i(^) ^ Yk>ij so ^ is aiso lexicographically maximal, and w is an equilibrium for P. 

Finally, consider an equilibrium w = {wi} for P and the weights w' = {w[} for B as defined above. 
From above, w' is feasible. Consider any other feasible assignment w' = {w'j) U {wj : j i} for B. Since w 
is an equilibrium, it is lexicographically maximal, so for w = {wi}U{wj : j ^ i} (where Wi{j) = w[{Pij)), 

G S, J2k>ijMk) > T.k>ijWi{k). Therefore, Vz,j G S, Ep,,>.p.. <(Pife) > T.Pi^>iPi^mPik), 
and w' is also lexicographically maximal and an equilibrium for B. □ 

Lemma 2. There is a polynomial-time reduction from the preference game to the fractional BBC game. 

Proof. We use a similar reduction from a preference game to fractional BBC. Given any instance P of the 
preference game We will create an instance B of fractional BBC = (y,(i, c, 6), where V = S, d = an 
additional node, Vf, j G V: c{i,j) = 1, Vi: b{i) = 1, plus length function Zj for each i e V, defined as 
follows. Let Pi(k) = the number of j such that j > j k. Vj ^ i, li{j, d) = 1, = Pi{j). ^ i,k y^ 

i, li{j, k) = li{k,j) = \S\ + 1. li{i,d) = 1 + Pi{i). Given a solution to B, define a solution to P: set Wi{j) 
= the weight placed on edge {i, j) (for j / i), and uii{i) = the weight placed on edge {i, d). 

Consider any instance P of the preference game, consisting of a set of players S and a preference 
relation >j for each i e S. We will create an instance B of fractional BBC = {V, d, c, b), where V = S, 
d = an additional node, Vz,j G V: c{i,j) = 1, Vi: b{i) = 1, plus length function li for each i ^ V, 
defined as follows. Let Pi{k) = the number of j such that j >j k. / i, d) = 1, k{i,j) = Pi{j). 
Vj y^ i,k y^ i, li{j, k) = li{k, j) = l^l + 1. li{i, d) = 1 +pi{i). Given a solution to B, define a solution to P 
by setting Wi{j) = the weight placed on edge (i, j) (for j ^ i), and Wi{i) = the weight placed on edge {i,d). 

Since the total cost for all edges is 1, and the total budget for a node is 1, each node in B will place total 
weight 1 on edges adjacent to it. This exactly corresponds to the requirement that = 1 in P. The 

possible paths for a one-unit flow from z to d in B are: (1) the path consisting of only edge (i, d), which 
has cost pi{i) + 1 < l^l + 1, (2) a path of the form (z, d) through some other node j, which has cost 
PiU) + 1 ^ 15*1 + 1, or (3) a path including some edge (j, k) for j ^ i,k ^ i, which has cost > \S\ + 1. 
Therefore, a minimum cost flow will only use paths of the form (i, d) and (i, j, d), so the requirement in P 
that Wi{j) < Wj{j) corresponds to using the weight j places on edge {j, d) as a capacity on that edge when 
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finding the min-cost flow. Now, we only need to show that a node's best response in B exactly corresponds 
to a lexicographically maximal weight assignment in P. 

Suppose we have a best response for node i in B that corresponds to a weight assignment w in P that 
is not lexicographically maximal for i. Then, there is some assignment w' = w'^U {wj j i} such 
that for some j G S, J2k>ij < Sfc>ii ^i(^)- There must be some £ S such that >i j and 

w[{k~^) > Wi{k^), and there must be some A;^ G such that >j j) and w'^{k~) < Wi{k~). Suppose 
we move e weight in the best response in B from Pj^.- to Pik+. Pi{k^) > pi{k^), so moving this weight 
will decrease the cost of a minimum cost flow, contradicting the fact that this was a best response. 

Suppose we have a lexicographically maximal weight assignment w for P that does not correspond to a 
best response for node i in B. Then, in B, i could move weight from some path Pij to a different path Pj/;. to 
decrease the cost of its min-cost flow. This means that Pi{k) < Pi{j), or the number of nodes preferred by 
i over k is smaller than the number of nodes preferred by i over j. Since preference relations are transitive, 
this implies that k >j j. However, since Pik had space left, Wi{k) < Wk{k), so w is not lexicographically 
maximal. □ 

3.2 Non-Convexity 

Theorem 2, There exists an instance of the preference game for which the set of equilibria is not convex. 



J^. J^. 

©r:f© ir-^® ®r-© ©r:f© ®_j© "~ ^B—j© sxr:© ' 



(a) The a players assign weights (b) The a players assign weights 1/2 — (c) Combining half of each equilibrium, 
1/2, 1/2, the b players both use fei, the c 1/2, the b players both use 62, the c play- x will assign 1/2 to ai, 1/4 to each of 
players both use C2. ers both use ci. 61 and ci. x could improve by assigning 

weight only to ai and 61 . 

Figure 1 : Example of an instance of the preference game for which the equilibrium set is not convex. 

Proof. Consider the following instance of the preference game. We have 3 sets of 2 players each, oi, 02, 
61, 62. ci,C2, and one additional player, x. The preference lists for these nodes ai^e: oi: (02, ai); 02: 
(01,02); 61: (62,61); 62: (61,62); ci: (c2,ci);c2: (ci,C2);x: (oi, 61, ci, x). (Each Ust gives strategies in 
order from most preferred to least preferred.) We now show two equilibria whose linear combination is not 



an equilibrium. In equilibrium w (figure 1(a) 1: Wa^iai) = 5, ii^ai(a2) = 5, 1^02 (0^2) = 5, '^^'02(01) = 5' 
u^6i) = 1, ^62(61) = 1, Wci{c2) = 1, w^C2) = 1, w^{ai) = I, Wx{bi) = \. In equilibrium w' (figure 



l(b)i: ^^(ai) = \, <j(a2) = \, w'^^{ai) = \, w'^^{ax) = \, ^1(^2) = 1, ^2(^2) = 1, w'^^{cx) = 1, 



w'^^{c\) = 1, w'^{a\) = \, w'^icx) = \. It is easy to verify that w and w' are both equilibria, and in a 



solution A • w + (1 — A) • If' (for any A > 4) (figure 1(c) shows A = i), player x would do better by moving 



more weight to its second preference. Therefore, the convex combination of w and w' is not an equilibrium. 

□ 

3,3 PPAD Hardness 

We show that finding an equilibrium in preference games is PPAD-hard. By our reductions of Lemmas[T] 
and |2j this immediately implies that finding equilibria in fractional BOP and fractional BBC games is also 
PPAD-hard. We will follow the framework of [7], which shows that finding a Nash equilibrium in a degree- 
3 graphical game is PPAD-hard, using a reduction from the PPAD-complete problem 3-D BROUWER. In 
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this problem, we are given a 3-D cube in which each dimension is broken down into 2~" segments - thereby 
dividing the cube into 2^" cubelets. We are also given a circuit that takes as input the 3 coordinates of the 
center of a cubelet (each as an n-bit number) and returns a 2-bit number that represents one of four 3-D 
vectors: either (1, 0, 0), (0, 1, 0), (0, 0, 1), or (-1, -1, -1). A solution to the 3-D BROUWER instance 
is a cubelet vertex such that the set of 8 results obtained by running the circuit on each of the 8 cubelets 
surrounding the vertex contains each of the four vectors at least once. 

As in 1 7 1, we will construct a set of gadgets to simulate various arithmetic operators, logical operators, 
arithmetic comparisons and other operators. We then follow their framework to systematically combine 
these gadgets to simulate the input boolean circuit and to encode the geometric condition of discrete fixed 
points in the 3-D BROUWER instance. In the preference game we construct, we specify the preference 
relation of any player P by an ordered list of a subset of the players, with the last element being P, also 
referred to as the "self" strategy. When we say that a player P plays itself with weight v, we mean that P 
assigns a weight of v to strategy P. We'll engineer the payoffs such that the game is only in equilibrium if 
the weights assigned by certain players to themselves successfully echo the inputs and outputs of 8 copies 
of the circuit that surround a solution vertex of the 3-D BROUWER instance. 

For this reduction, we require the following sets of players. 

1. One player for each of the 3 coordinates (the coordinate players). If the graph is an equilibrium, 
each coordinate player plays itself with weight equal to its coordinate of the 3-D BROUWER solution 
vertex. 

2. One player for each of the bits of each of the 3 coordinates (the bit players). In order to force these 
players to correctly represent the bits, we need some additional players. Assuming we've correctly 
calculated the first i — 1 bits of coordinate x (call them xq, . . . , we can create the i*'' bit as 
follows. One player will play itself with weight pi = x — X^}=o fj- The bit player will play itself with 
weight equal to the i'^^ bit. If pi > i-, then this bit should be 1. Otherwise, it should be 0. Therefore, 
in order to properly extract the bits, we create the following four types of players. 

(a) HALF player: In any equilibrium in which a given player plays itself with weight a, the HALF 
player will play itself with weight |. 

(b) DIFF player: In any equilibrium in which two given players play themselves with weights a and 
b, the DIFF player will play itself with weight a — b. 

(c) VALUE player: In any equilibrium, the VALUE player plays itself with weight ^. This can be 
easily created by combining a player whose first preference is itself with a HALF player. 

(d) LESS player: In any equilibrium in which two given players play themselves with weights a and 
b, respectively, the LESS player plays itself with weight 1 iff a > b, and plays itself with weight 
otherwise. (Actually, the LESS player we create will be inaccurate if a and b are very close, 
which we discuss more below.) 

3. One player simulates each type of gate used in the circuit of the 3-D BROUWER instance. For this, 
we create 3 more types of players. 

(e) AND player: In any equilibrium in which two given players play themselves with weights a and 
b, the AND player will play itself with weight a A 6. 

(f) OR player: In any equilibrium in which two given players play themselves with weights a and 
b, the OR player will play itself with weight a V 6. 
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(g) NOT player: In any equilibrium in which a given player plays itself with weight a, the NOT 
player will play itself with weight -la. 

4. Finally, we need to ensure that the graph is in equilibrium if and only if all four vectors are represented 
in the results of the 8 circuits. As in [7|, we will represent the output of each circuit using 6-bits, one 
each for +x, —x, +y, — y, +z, —z. Now, the 4 possible result vectors are represented as 100000, 
001000, 000010, and 010101. We can use these circuit results with only two additional types of 
players to feed back into the original coordinate players. First, we will create an OR player for each 
of the 6 bits (over the 8 vertices), which yields a result of six 1 's if and only if this is a solution vertex. 
Therefore, an AND player for each coordinate will all return 1 if and only if this is a solution vertex; 
at least one of the coordinates will be otherwise. We can turn this around using a NOT player for 
each coordinate, so that we get all O's if and only if this is a solution vertex. Finally, we need the last 
two new player types, which we'll use to add these results back to a copy of the original coordinates 
(the result will be the original coordinate player). 

(h) COPY player: In any equilibrium in which a given player plays itself with weight a, the COPY 
player will also play itself with weight a. 

(i) SUM player: In any equilibrium in which two given players play themselves with weights a and 
b, the SUM player will play itself with weight min(a + 6, 1). 

If the coordinates represented a solution vertex to the 3-D BROUWER instance, then all the values 
we've added back in will be zero; so the coordinate players cannot do better by changing their strate- 
gies. On the other hand, if the coordinates do not form a solution vertex, then at least one of the values 
is 1, so that the coordinate player will have incentive to change strategies and play more weight on 
itself. 

We now describe how to create the new types of players (gadgets) required for the reduction. For each 
of these gadget definitions, we assume we are given a preference game such that in any equilibrium, node 
X plays itself with weight vi and node Y plays itself with weight V2- For the first three gadgets, we assume 
vi,V2 G {0, 1}. For the rest of the gadgets, we assume vi,V2 G [0, 1]. 

OR{X, Y) 

We can add a new node R = OR{X, Y) that will play itself with weight vi V f 2 in any equilibrium. Create 
a node Ri with preference list {X, Y,Ri). Let node R's preference list be i?). Now, if vi and/or V2 is 
1, then Ri will play Ri with weight 0, so R will play itself with weight 1. If both vi and V2 is 0, then Ri 
will play itself with weight 1 , so i? will play i?i with weight 1 and R with weight 0. 

NOT(X) 

We can add a new node = NOT(X) that will play itself with weight -lUi in any equilibrium. Let node 
A^'s preference list be {X, N). Clearly, will play X as much as vi and will play N with the remainder. 

AND(X, Y) 

We can add a new node A = AND{X, Y) that will play itself with weight vi A V2 in any equilibrium. 
Assemble the OR and NOT gadgets NOT(OR(NOT(X), NOT(y))). 
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SUM(X, Y) 

We can add a new node S = SUM(X, Y) that will play itself with weight max{l, vi + V2) in any equilib- 
rium. Create a node 5i with preference list {X, Y, Si). Let node S"s preference list be {Si, S). Now, clearly 
node Si will play 5i with weight max{0, 1 — vi — V2), and node S will play Si that same amount. So node 
5 will play itself with weight 1 — max{0, 1 — vi — V2)- In other words, if + t>2 > 1, then S will play 
itself with weight 1. Otherwise, S will play itself with weight 1 — 1 + t)i + = f 1 + f 2, as desired. 

DIFF(X, Y) 

We can add a new node D = DIFF(X, Y) that will play itself with weight vi — V2 if > t'2, or otherwise 
in any equilibrium. Create a node Di with preference list {X, Di). Di will play itself with weight 1 — vi. 
Now set the preference list for D to {Di,Y, D). D will play itself with weight min(0, 1 — (1 — f 1) — V2) = 
min(0, vi — V2), as desired. 

COPY(X) 

We can add a new node C = COPY(X) that will play itself with weight vi in any equilibrium. Create a 
node Ci with preference list {X, Ci). Ci will play itself with weight 1 — vi. Set the preference list for node 
C to (Ci, C). C will play Ci with weight 1 — vi, leaving weight vi on C. 

DOUBLE(X) 

We can add a new node M = DOUBLE(X) that will play itself with weight min(l, * 2) in any equilib- 
rium. Create player Mi = COPY(X) and set M as SUM(X, Mi). 

LESS(X,y) 

Given ei {<d < ei < \), We can add a new node L = LESS(X, y) to the game that in any equilibrium 
will play only itself if vi — V2 > €1, and will play Li (for a new node Li) if vi < V2. First create 
D = DIFF(X,y). Then create Mi = DOUBLE(L»). For i = 1 to - lege/, create player Mj+i = 
DOUBLE(Mj). Call the last DOUBLE player node L and the extra player for the sum player of the last 
DOUBLE player node Li. If vi < V2, the DIFF player will return 0, so player L will play the result of 
multiplying by 2 many times, or 0. lfvi — V2 > e;, player L will play the max of 1 and (wi — 1)2) *2~ '°s<s! = 

_ „2) * 1 > I = 1. 

HALF(X) 

We can add a new node H = HALF(X) that will play itself with weight t;i/2 in any equilibrium. Create a 
node Hi with preference list {X, Hi). Hi will play itself with weight 1 — vi. Then create two more nodes: 
H2 and H^. Node H2 has preference list (//i, H2). Node H^ has preference list {Hi,H, H^). Set the 
preference list for node H to be {Hi ,H2,H). Each of H, H2, and H3 will use its first choice with weight 1 — 
vi, leaving vi for its other two choices. Then, we have wh{H)+wh{H2) = vi, WH2 {H2)+wh2 {H3) = vi, 
and WHz{H'i) + wh-j,{H) = vi. In any equilibrium, it must be true that wh{H2) = wh2{H2), wh2{Hz) = 
WH3{Hz), and wh3{H) = wh{H). Solving this gives wh{H) = wh{H2) = wh2{H2) = wh2{Hz) = 

WH,{H^)=WH,{H) = f . 

As in 171, our LESS player plays the specified action (itself, in our case) with weight 1 if u 1 > V2+ei, and 
plays itself with weight if i;i < t;2> but will play some unspecified fraction on itself if t;2 < vi < V2+ei. We 
use the LESS player to extract the bits representing the coordinates of a cubelet to be passed into the circuit. 
This procedure is identical to that of [7]. Let X denote the x-coordinate player, and let Xi = COPY{X). 



10 



For i from 1 through n, we create players Bi = LESS(2~*, Xj) and Xj+i = DIFF(Xj, HALF*(i?j)), where 
HALF* indicates applying the HALF gadget i times. It can be shown that as long as x is not too close to 
a multiple of 2~", we will extract its n bits correctly. If this is not the case, however, we will not properly 
extract the bits, and our circuit simulation may return an arbitrary value. We resolve this problem using 
the same technique as in Q : we compute the circuit for a large constant number of points surrounding the 
vertex and take the average of the resulting vectors. Since these details are almost identical to that of Q 
Lemma 4], we omit them. From this reduction, we get: 

Theorem 3. It is PPAD-hard to find an equilibrium in a given preference game. □ 

3.4 Approximate equilibria 

Given the hardness of finding exact equilibria in preference games (and fractional BGP and BBC games), 
a natural next question is whether it is easier to find approximate equilibria. We define an e-equilibrium of a 
A; -player preference game to be a set of weight distributions wi, .. .,Wk that satisfy the following conditions 
for every player i: (a) Ylij'^iij) — 1' (b) f^J" ^^^^ i> Wi{j) < Wj{j) + e; and (c) for each j, either 
"^tiyj > 1 — e or \ wi{j) — Wj{j)\ < e. In other words, the weight assigned by a player i on another 
player j is at most e more than the weight assigned by j on itself; and for any i and j, either i plays a total 
weight of at least 1 — e on players it prefers at least as much as j or the weight assigned by i on j differs 
from that assigned by j to itself by at most e. Note that there exists some threshold preference such that any 
player preferred strictly more than that must be "filled" to within e of the allowed weight. The rest of at least 
1 — e weight must be placed on players at the threshold preference. At most e weight is left for players with 
preference lower than the threshold. 

Two notions of approximation have been defined for fractional BGP: an e-solution by lITSll and e-stable 
solution by ll20ll . The (polynomial-time) reduction of Lemma [T] mapping a given preference game instance 
P to a fractional BGP game instance B has the property that any e-solution or e-stable solution for B is, in 
fact, an e-equilibrium for P. This implies that any PPAD-hardness on finding e-equilibrium for preference 
games immediately yields an equivalent result for both notions of approximation for fractional BGP. 

Theorem 4. It is PPAD-hard to find an e-equilibrium for preference games, for some e inverse polynomial 
in n. 

Proof. Our proof follows the framework of f5^,'6l for proving the hardness of approximating Nash equilibria 
in 2-player games. This framework starts with a high-dimensional discrete fixed point problem, BROUWER, 
which is also PPAD-complete. The input to the problem is a Boolean circuit that assigns a color from 
{1, n, n + 1} to each interior node of an n-dimensional grid {0, 1, 8}". This grid has about 2^" cells, 
each of which is an n-dimensional hypercube. The discrete fixed point is defined to be a panchromatic 
simplex inside a hypercube. This framework of lH |6l uses a new geometric condition for discrete fixed 
points, which requires that the average of sampled points in the interior of the targeted panchromatic 
simplex is inverse-polynomially close to the zero vector. The rest of the proof follows the framework of Q. 

Our broad definition of an e-equilibrium poses additional technical challenges which did not occur in 
the reductions of [5^, 6 1 . In particular, in the presence of errors, our Boolean gadgets only approximately 
simulate the Boolean operations, while in previous reductions, the Boolean gadgets are precise. Therefore, 
most of our technical effort is to prevent the magnification of errors in Boolean simulation. In our proof, we 
have designed a CORRECTION gadget to accomplish this. 

We focus on the necessary changes for the gadgets of Theorem [3] to account for errors, and the descrip- 
tion and use of the new CORRECTION gadget. Other details closely match those of |l5l|6l|7l. 
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Let ei (the measure of the fragiUty of our LESS gadget) be a real number such that e < ef. Then, we 
have the following error bounds. 

Lemma 3. Assuming node X plays itself with weight v'l, vi — 2ei <v'i<vi + 2ei, and node Y plays itself 
with weight f2 — 2e/ < V2 < V2 + 2e/, each of the boolean gadgets defined in the proof of Theorem |j] 
plays itself within ±(46; + 6e) of the correct value for the correct vi and V2 inputs. 

Proof OR 

If vi and/or V2 is I, then v'l and/or V2 is at least 1 — 2ei, and node Ri will play Ri with weight at most 
2e/ + e, so ii will play R with weight at least 1 — 2e; — 2e. If both vi and V2 are 0, then v'l and v'2 are at 
most 2ei, and node Ri will play Ri with weight at least 1 — 4e/ — 2e, so R will play R with weight at most 
4ei + 3e. 

NOT 

If vi = 1, v'l is at least 1 — 2e/, and node N will play itself with weight at most 2e/ + e. If vi = 0, v'l is at 
most 2ei, and node will play with weight at least 1 — 2ei — e. 

AND 

The AND gadget concatenates other new players to get -'{-'Vi V -'V2)- Each NOT may add at most one 
additional e error to the given value, and the OR may add up to 3e error (on top of the sum of the errors 
from both inputs). So the AND player will return a value within an additive 4ei + 6e of the correct or 1 
answer. □ 

Lemma 4. Each of the arithmetic gadgets plays itself within ib5e of the correct value for the input it is 
given. 

Proof SUM 

Node Si will play Si with weight w{SiT) £ [max{0, 1 - v[ - V2 - 2e),max(0, 1 - v[ - V2 + 2e)]. So 
node S will play S with weight ws{S) G [v[ + f 2 — 3e, v[ + f 2 + 3e], unless wsi (Si) = 0, which means 
v[ + V2> I — 2e. In this case, node S will play S with weight at least 1 — e. 

DIFF 

Node Di will play DiT with weight {Di) G maa;(0, 1 — v'l — e),max{0, 1 — v'^ + e)]. Node D will 
play D with weight wd{D) G [max(0, v'l — f 2 — 3e), ma3;(0, v'^ — ^2 + 3e)], unless wdi{Di) = which 
means v'i>l — e. In this case, node D will play D with weight at least 1 — — 2e and at most 1 — f 2 + e 
(not 2e because we cannot underfill the strategy with weight 0). 

COPY 

Node Ci will play Ci with weight at least 1 — v'l — e and at most 1 — v'^ + e. Node C will play C with 
weight at least v'^ — 2e and at most v'^ + 2e. 

HALF 

Node iTi will play Hi with weight ru^/j {Hi) E [1 — f — e, 1 — w'^^ + e], and each other player will play its 
second and third preferences with total weight between 1 — {Hi) — e and 1 — whi {Hi) + e. Each other 
player will play itself half of this amount plus or minus 3e (this is easy to verify by writing the system of 
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v' 

inequalities and checking the extreme points). Therefore, node H plays H with weight at least — 4e and 

v' 

at most Y + 4e- 
DOUBLE 

The DOUBLE gadget consists of a copy player, which adds at most 2e error, and a sum player, which adds 
at most 3e error on top of the sum of the errors in the two inputs. Therefore, node M plays M with weight 
at least 2v[ — 5e and at most 2v'i + 5e. □ 

Lemma 5. The LESS player will play itself with weight < ei if it is given v[,v'2 such that v[ < v'2, and with 
weight > 1 — ei if v'^ — f 2 > 

Proof LESS 

The LESS gadget inherits its susceptibility to error from its initial DIFF player (which was, in the exact 
equilibrium case, non-zero if and only if vi < U2). For the case where vi < V2, we can account for the 
errors of the DOUBLE players (used to repeatedly amplify the difference) simply by adding extra iterations 
of DOUBLE. Since we stipulated that e < ef , a value that started < 5e will remain < ei, even after doubling 
enough times to push a value > e/ to a value over 1 (including extra multiplications to account for the 
DOUBLE errors). Therefore, the LESS player will play itself with weight less than e; if v[ < v'2 and with 
weight greater than 1 — e; if v[ — f 2 > e^. □ 

Next, we generate another gadget that can be used to amphfy the results of each boolean logic player 
before using it, in order to ensure that each input within the circuit is close to the correct value. 



CORRECTION 

After a single gate (if the inputs are within additive 2€i of the correct or 1 inputs), a player will play itself 
at least 1 — 4ei — 6e if the correct answer is 1, and at most 4e/ + 6e if the correct answer is (based on the 
analysis in the proof of Lemma |3]l. Therefore, we need only to add a LESS player to determine whether or 
not the result is < ^ and adjust the value in the correct direction using HALF or DOUBLE players. 

Lemma 6. By using a CORRECTION gadget after each boolean logic gadget, we can ensure that the output 
from each gate is at most 2ei away from the correct output. 

Proof. The results of a single gate gadget will be at least 1 — 4e/ — 6e if the correct answer is 1, and at most 
4e; + 6e if the correct answer is 0. If the result is < 5, we will add three HALF players: the first reduces 
any result (at most 5e;) to at most ^ + 4e (notice that we may add an additional 4e error from the HALF 
player), the second reduces it to at most ^ + 6e, the third to at most ^ + 7e, which is at most e/, since 
e ^ . If the result is > ^, we add a single DOUBLE player, which should give us a result of at least 1 — e 
(since the input is very close to 1, the extra player in the SUM portion of the gadget has to play 0). However, 
we do collect a small additional error term because of the LESS used in the CORRECTION player. 

We can use the LESS player as an if-statement (as needed above) as follows: LESS will play one of 
two strategies with weight close to 1, the other with weight close to 0. Say Pi is the strategy that will be 
played with weight close to 1 (> 1 — e^) if and only if f 1 < ^, P2 is the strategy played with weight close 
to 1 (> 1 — e;) if and only if vi > ^. We create the necessary players for both the HALF gadget and 
the DOUBLE gadget, but add P2 as the first choice preference for the three players in the HALF gadget 
(labeled H, H2 and i/3 in the gadget description), and add Pi as the first choice preference for the COPY 
and SUM players in the DOUBLE gadget (players C and S, but not players Ci and Si). Add one additional 
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player S\JM{H, D), where H is the HALF player and D is the DOUBLE player (one of the two is playing 
itself with weight close to 0). To show the correctness of the CORRECTION gadget, consider the following 
case analysis, assuming the result we are trying to correct is value v G {[0,5e/), (1 — bei, 1]}. Call the 
four players that make up the DOUBLE gadget Ci (the extra player for the COPY portion), C (the COPY 
player), Si (the extra player for the SUM portion), and S (the SUM player), and the four players that make 
up the HALF gadget H, Hi, H2 and H-^ (as above): 

Case I: V < 5ei. Ci will play itself with weight at least 1 — v — e > 1 — 5ei — e. C will play Pi with weight 
at least 1 — — e. It must play the rest of its weight on the heavily-weighted Ci . Si will play some 
amount on the player that has weight v and some on C, but must have at least l — 2ei — 3e left for itself. 
S will play at least weight 1 — e; — e on Pi, and must play the rest of its weight on heavily- weighted 
player Si , leaving on itself. 

Meanwhile, Hi will play at least 1 — t> — e on itself, so each of H, H2 and H^ will use up to within e 
of the weight of P2 (which may be 0), and of the weight of Hi (at least 1 — v — 2e), leaving at most 
V + 2eto be divided in half. As stated above, this remaining amount will be split to within ±3e across 
the strategies, so the result will be at most fe; + 4e. Since e is much smaller than e/, the additive e 
values with each iteration of the HALF gadget will be covered by the ei. 

The SUM player in the CORRECTION gadget will return a value at most the correct sum (< ei from 
the previous paragraph) plus 3e. 

Case 2: v > 1 — 5ei. Ci will play itself with weight at most 1 — v + e < 5ei + e. C will play Pi with weight 
at most ei + e, and will play Ci with weight at most 5ei + 2e, leaving at least 1 — 6e; + 3e on itself. 
Si will try to play at least 1 — 5e; — e on the player that has weight v on itself and at least 1 — 66; — 4e 
on C, which will leave nothing left for itself. S will play at most e/ + e on Pi, and at most e on 5i, 
leaving at least 1 — e; — 2e for itself. 

Any errors in the HALF player for this case will be if our player puts > weight on the HALF player. 
However, this will only help to inflate the final result of the CORRECTION gadget. 

The SUM player in the CORRECTION gadget will return a value at least the correct sum (> 1 — — 
2e) minus 3e, or at least 1 — ei — 5e > 1 — 2ei. 

Using this CORRECTION gadget after each gate, we keep our input values to within 2ei of the correct 
values, as required. □ 

After the corrections, we're left with the following possible errors due to the e-approximation. We have 
small errors in the bit extraction, which are no larger than the parallel errors in \J] (they verify that these 
small error values will not affect the final result). We also have small errors (at most 2e;) coming out of the 
circuit. As in miSI, we will repeat the circuit a polynomial number of times and take the average in order 
to override any errors from the LESS gadgets in the bit extraction. 

Taking an average of two results requires 3 steps: first we divide each "bit" in half (we cannot take the 
average of the entire values because we have a max value of 1 for any player, so the average of two I's 
would come out to Here, we may pick up 4e of error for each of the two results. Then, we sum the two. 
The total error so far is at most lie. Finally, we take half of the sum, which also divides the error in half, but 
may add up to an additional 4e of error, for a total additional error of at most 9.5e from taking the average 
of 2 results. 

We can add CORRECTION gadgets periodically during the averaging and during the final OR, AND 
and NOT of the results to keep our total errors under 2e;. In other words, if this is a solution vertex for 
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BROUWER, then we will have 6 players, each playing at most 2e;. If this is not a solution vertex, then at 
least one of the 6 players will play at least 1 — 2e;. Suppose we have an e-equilibrium in this game, and 
the x-coordinate player is playing value x. This is a SUM player, and the extra player from the SUM gadget 
must be playing between 1 — x — e and 1 — x + e. Therefore, the sum of the two values it is adding (a copy 
of the coordinate player and the feedback NOT player) must be between x — 3e (if this player overfills each 
of its top stretagies by e) and x + ?,e (if this player underfills each of its top strategies by e). We know that 
the copy player must be playing the same value as the coordinate player to within 2e (between x — 2e and 
x+2e). Adding this range to a number > l — 2ei cannot possibly give something in the range [x — 3e, x+3e], 
so the feedback player must be playing a value at most 2e/ on itself (since we know the feedback player will 
play either a value < 2ei or a value > 1 — 2e;), and the correct feedback must be 0, so this is a valid fixed 
point. 

□ 

Theorem |4] implies that it is PPAD-hard to find an equilibrium in both fractional BGP and fractional 
BBC games. Since it is PPAD-hard to find a fractional BGP equilibrium, it is natural to next consider special 
instances when it might be easier to find an equilibrium. For instance, in real world internet routing, BGP 
path preferences are primarily based on a combination of security considerations and shortest paths. What 
would happen if we restrict ourselves to path preferences that echo the real world? Unfortunately, using 
only small adjustments to the above hardness proof, we show that it is PPAD-hard to find an equilibrium 
even if all preferences are based only on shortest path lengths. 

Theorem 5. Fractional BGP is PPAD-hard even if each node's preference list consists of all paths, ordered 
shortest to longest based on edge length ( where each node defines its own edge lengths, which may not obey 
triangle inequality ). 

Proof. We will implicitly translate the proof of Theorem[4]to a corresponding proof for BGP, by assuming a 
destination node T, each preference by player V for a player U is now a preference for a path {V ^ U — > T) 
(abbreviated {VUT)), and each preference by player V for "self" is now a preference for path {V T) 
(abbreviated {VT)). We will add a set of edge lengths for each node in the gadgets such that the preferences 
in the gadget definitions follow shortest path distances according to the specified lengths. 

For each node U used in each of the gadgets, the preference list is of the form (UVT, UT), {UVT, 
UWT, UT), or {UVT, UWT, UZT, UT) (the last is only for the HALF player in the CORRECTION 
gadget). For preferences of the first form, we will assign edge lengths 1{UV) = 1,1{VT) = 1, and all other 
lengths are 3. Clearly, to get to T through any node other than V , the cost will be greater than 3, so the direct 
path will be preferred. The distance via V is 2, so this will be preferred over the direct path. For preferences 
of the second form, we will assign edge lengths 1{UV) = l,l{VT) = l,l{UW) = 2,1{WT) = 1, and 
all other lengths are 4. Clearly, the preferences for the 3 paths in the list will be correctly ordered based on 
distance. Any path involving a node other than VorW will have length greater than 4. Edges VW and 
WV both also have length 4, so any path to T that uses VorW (that is preferred over the direct path) cannot 
include both V and W. This leaves only the paths in the original preference list. For preferences of the third 
form, we will assign edge lengths 1{UV) = 1, 1{VT) = 1,1{UW) = 2, 1{WT) = 1, 1{UZ) = 3, 1{ZT) = 
1, and all other lengths are 5. Similar reasoning shows that this preserves the preference list. □ 

Theorem 6. Fractional BGP is PPAD-hard even if all preferred paths are preference-ordered based on the 
path length (where each node defines its own distances on the edge lengths, and these distances form a 
metric and obey triangle inequality), assuming we may only use edges from a given template graph. 
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Proof. As in the proof of [5} we will implicitly translate the proof of Theorem |4] to a corresponding proof for 
BGP, by assuming a destination node T, each preference by player V for a player U is now a preference for 
a path {VUT), and each preference by player V for "self" is now a preference for path {VT). 

We will add a set of edge lengths for each node in the gadgets such that the preferences in the gadget 
definitions follow shortest path distances according to the specified metrics. 

First, we will replace each direct path with a 2-hop path, by adding an extra node (whose only preference 
is for its own direct path). In other words, we will replace any path of the form UT with a path of the form 
UU'T. We will replace any use of a direct path, such as VUT, with a use of the modified path: VUU'T. We 
will remove all other edges straight to T from the template graph, and we remove all edges into a new node 
U' except the edge from U. Removing edges straight to T is necessary because a preference list {VUT, VT) 
does not obey triangle inequality for any metric. However, the list {VUU'T, VV'T) is a valid preference 
list if y uses the following edge lengths: 1{VU) = 1,1{UU') = 1,1{U'T) = 1,1{VV') = 2,1{V'T) = 2 
(assuming VT is not allowed). Removing other edges into U' is necessary because otherwise any path 
VU'T would have to be preferred at least as much as VUU'T. 

Now, for each node U used in each of the gadgets, the preference list is of the form {UVT, UT), {UVT, 
UWT, UT), or {UVT, UWT, UZT, UT). With the new additional nodes, each node now has a preference 
hst of the form {UVV'T, UU'T), {UVV'T, UWW'T, UU'T), or {UVV'T, UWW'T, UZZ'T, UT). For 
the first type of preference list, we will define the length of each leg of the most preferred path to be 1 , the 
length of each leg of the second path to be 2, and any other edge in the graph has length 3. It is easy to verify 
that these lengths obeys triangle inequality and give the required preference order. For the second type of 
preference list, we will assign edge lengths 1{UV) = 2, 1{VV') = 1, 1{V'T) = 1, 1{UW) = 2, 1{WW') = 
2, 1{W'T) = 1, 1{UU') = 3, 1{U'T) = 3. In order to ensure triangle inequality, set 1{VW) = 1{WV) = 4. 
The rest of the edges in the graph had length 5 (so any path to the root containing any node other than 
U, U', V, V, WandW has length at least 10). In the smaller graph containing only U, U' , V, V , W, W, 
the paths to the root that haven't been included in the preferences list or specifically excluded by restricting 
the edges are UVWW'T (which has length 9) and UWW'T (which has length 8) - both are longer 
than any path in the preference list. For the third type of preference list, we will assign edge lengths 
1{UV) = 3,1{VV') = 1,1{V'T) = 1,1{UW) = 3,1{WW') = 2,1{W'T) = 1,1{UZ) = 2,1{ZZ') = 
3, 1{Z'T) = 2, 1{UU') = 4, 1{U'T) = 4. In order to ensure triangle inequality, set 1{VW) = 1{WV) = 6, 
1{VZ) = 1{ZV) = 1{WZ) = 1{ZW) = 5. The rest of the edges in the graph have length 5. 

Since we've added an additional edge to every path, this construction adds up to e enor for each player 
in the proof of theorem]?] However, these errors will still be overpowered by the ei errors from our LESS 
gadget, so the proof could easily be adjusted to compensate. □ 

Notice, if any edge may be used, and if the preferences are based on shortest path lengths for a metric 
defined for each node, then there is a trivial algorithm for finding an equilibrium: each node only follows the 
"direct to destination" path. Since a metric must obey triangle inequality, this path length cannot be strictly 
longer (cannot be less preferred) than any path including additional nodes. 

4 Existence and Rational Solutions via Personalized Equilibria 

We introduce a new notion of an equilibrium for matrix games based on min-cost flows. Because the 
flow-based payoff functions enable each player to individually match her distribution to her opponents' 
distributions, we call this a personalized equilibrium. We study the structural properties of personalized 
equilibria and analyze the complexity of finding such an equilibrium. We show that both the fractional BGP 
game and the fractional BBC game are special cases of matrix games in which players seek a personalized 
best response. 
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We first define personalized equilibria for two player games. We then extend it to multi-player games, 
including multi-player games with succinct representations. Consider a matrix game {R, C) between two 
players ROW and COLUMN, in which player ROW has strategies ri, r2, . . . , and player COLUMN has 
strategies ci, C2, . . . , Cn. R G M"*^" is the payoff matrix of ROW, and C G ]K™x" is the payoff matrix of 
COLUMN. 

Like a standard bimatrix game, if player ROW selects r j and player COLUMN selects Cj , the payoff to 
ROW is R[i,j] and the payoff to COLUMN is C[i,j]. Suppose ROW selects a distribution x among the 
strategies {ri, r2, . . . , rm}, and COLUMN selects a distribution y among {ci, C2, . . . , Cn}- Unlike payoffs 
defined for mixed strategies, in which the payoff to ROW is j x[i]y[j]R[i, j] and the payoff to COLUMN 
J2i j x[''']y[j]C[i, j], we define the payoffs using flows. The payoff to ROW is: 

Payoff (ROW) = max ^ UijR[i,j] (1) 

^ij ■ ■ 

subject to Ujj = x[i], Mi and ^^Mjj=y[j], Vj; 
Payoff (COLUMN) = max V VijC[i, j] (2) 

subject to ^Vjj = Vi and ^Vi^j = y[j], Vj. 

3 i 

In other words. Payoff (ROW) is the cost of a 1-unit min-cost flow from source r to destination c in the 
directed graph Gr = {Vr, Er), with 

Vr = {r, c,ri,r2,...,rm,ci,C2,...,c„} 

Er = {{r ^ ri), \/i}U {{n ^ Cj), \/iJ}U {{cj ^ c), 

where the capacity of edge (r r^) is x[i], the capacity of edge {cj c) is y[j], and the capacity of all 
other edges is +oo. The cost of edge (rj — > Cj) is —R[i,j], and the cost of all other edges is 0. We note 
that for any distributions x and y, a unit-flow from r to c always exists, so the above payoff function is 
well-defined. 

Similarily, Payoff (COLUMN) is the cost of a 1-unit minimum-cost flow from source c to destination r 
in the directed graph Gc = {Vc, Ec), with 

Vc = {r,c,ri,r2, . . . ,rm,ci,C2, . . . ,c„} 

Ec = {{c^ Cj), \/j}U {{cj ^ u), \/i,j}U {{u ^ r), 

where the capacity of edge (c — > Cj) is y[j], the capacity of edge (r^ r) is x[i], and the capacity of all 
other edges is +oo. The cost of edge (cj —>■ ri) is —G[i, j], and the cost of all other edges is 0. 

Because there is no condition such as u[i,j] = v[i,j] in Eqn. ([T]), (or in the payoff function for COL- 
UMN) each player can individually choose the best way to match the distributions. We therefore refer to 
these payoff functions as personalized payoff functions, and we call an equilibrium for the game with these 
payoffs a personalized equilibrium. Using personalized payoffs, each player plays a distribution across her 
strategy space and chooses how to combine it with the strategy distributions of the other players. 

In addition to the fractional BGP and BBC games, this concept of personalized equilbria is inspired by 
the correlated equilibrium of Aumann (Q). Recall that the correlated payoff function requires Uij = Vij 
in Eqn. (jl]) and Eqn. but relaxes Nash's condition of mj = Vij = x[i]y[j]. We are considering payoff 
functions (personalized payoff functions) which futher relax this by removing Uij = Vij. 
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One can extend the personalized payoff functions to multi-player matrix games. Suppose we are given 
a A; -player matrix game G, with Si being a set of rm strategies for player i, \ < i < k, and Ui : W- Sj — > M 
being the utility function for player i. As in a mixed strategy, each player i chooses a probability distribution 
Pi : Si —>■ [0, 1] over the strategies in Si. Given pi, pk, the personalized payoff function for player i is 
computed as follows. Construct a hypergraph Hi with V = UjSj as the set of nodes and E = Y[j Sj as the 
set of hyperedges. Consider a (fractional hypergraph) matching defined by an assignment Wi : E —>■ of 
weights to each hyperedge that satisfies the condition that the sum of weights of all hyperedges adjacent to 
any strategy s £ Sj (for any j) equals Pj{s). Define the weight of matching Wi as X^eeE Wi{e)ui{e). The 
payoff to player i is then simply the cost of the maximum-weight matching in Hi. 

The concept of personalized equilibria is extendible to games with succinct representations such as 
graphical games [ 19 1 and multimatrix games |34|. It can also be viewed as a relaxation of correlated equi- 
librium, as mentioned above. 

Theorem 7. Finding an equilibrium in the fractional BGP game can be reduced to finding a personalized 
equilibrium in a matrix game. 

Proof. Consider any instance B of fractional BGP. We will create a matrix game, M, such that a solution to 
the M is a personalized equilibrium if and only if a corresponding solution to B is an equilibrium. 

For each node v in B, create a player v' in M. Assign v' one strategy P' for each path P S vr(t>), plus 
one strategy for "no path." Let qv{P) = the number of paths Q such that P >y Q. Next, we will define 
the payoff to v' for a hyperedge in M containing P' (for P G tt{v)). If this hyperedge contains all proper 
suffixes of P, then the payoff to v' will be qv{p) + 1. Otherwise, the payoff to v' will be 0. All hyperedges 
including the "no path" strategy for v' will have payoff for v'. 

Given a set of distributions {p^t } and a set of hyperedge weights w in M, we can assign path weights w' 
in B: w'^{P) = p^i{P'). If "no path" has any weight, this weight is not assigned in B. 

We will show that a solution in B is feasible if and only if the corresponding solution to M is feasible, 
then show the correspondence of equilibria. The unity condition is clearly preserved: the distribution for a 
node in M is a distribution of 1 unit. The weights placed on edges in B also sum to 1. Now let's consider 
the tree condition. Suppose the tree condition is violated in B. Then, there exists a path S starting at some 
node u such that for another node v, YIiP^-kv s > '^'ui^)- This means that in M, we had a strategy 

S' of node u' such that for node v' , ^v'C^) > Pu'{S), which means the solution to M was also 

infeasible. Now, suppose we have a solution to M that is infeasible. Then, there is some S' G S^' such that 
for some node v' , J2eeE-S'€e'^^'i^) > Pu'{S'). If the weight placed on S' was from a path that did not 
include s as a subpath, v' could move the excess weight from S' onto any strategy of u' without changing 
the payoff, so all remaining weight on S' much be from paths that contains S' as a suffix. But then, for B, 
we have J^pi^ttv s '^'v{P) > '^'u{S) - another infeasible solution. 

As a first step in showing that the equilibria are equivalent, we will note that fractional BGP preference 
lists across paths can be replaced with preference weights. Any weights that preserve the > relationship will 
also preserve the set of equilibria. To show this, first define preference weights Uy{P) for all paths P G it{v) 
such that Uy{P) > Uy{Q) if and only if P >^ Q. Now, suppose we have an equilibrium w using weights 
u which is not lexicographically maximal, plus a lexicographically maximal solution w' . Let V = the set of 
paths such that for all P and Q'mV, P >v Q, Q >v P, and for all Q >v P, Wv{Q) = Wy{Q). Then, we 
know that 

1- J2Q:Q>^P,Per '^^(Q) = 12q:Q>^P,P(^V 

2. Uy{P) = Uy{Q) if PeV,QeV 
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3. u^{P) > u^{Q) if PeV,Q^V. 

Replacing w with w' will keep the same weights on all paths with strictly higher preference weight that 
Uv{P),P G V, increase the weight of paths in V by some total increase amount I, and decrease the total 
weight by / of the paths with preference weight < Uy{P),P G "P, thereby improving the solution. Similarly, 
if we have an equilibria using preference weights, it must be lexicographically maximal. 

Now, we can write a linear program to find a best response for fractional BGP, In this LP, Wy is the set 
of weights assigned by node v. 

max Wv{P)uy{P) 

Wv{P) < Wu{S) S e tt{u), l<u<k 

PeTT(v,S) 

Wv{P)>0 P e Tr{v) 

We will also adjust the linear program for finding a best response using personalized payoffs to work 
for graphical games. In the graphical representation, we write e £ E to represent a hyperedge, where 
a hyperedge is a subset of at most one strategy per player (compared to exactly one strategy per player 
previously). We still use Wv{e) to mean the weight placed by player v on hyperedge e, Uv{e) to mean the 
payoff to player v for hyperedge e, and Pv{s) to mean the weight placed by player v on his own strategy s. 
The following hnear program defines a best response for player v. 

max Wy{e)uy{e) 

Wy{e) < pu{s) s e Su,l <u < k 

e:s6e 

Wyi{e) > e e E 

Now, if we assign preference weights for B: Uy{P) = Uv{e) where e is the hyperedge in M correspond- 
ing to P and all suffixes of P, the two linear programs are exactly equivalent. Therefore, the set of equihbria 
is exactly equivalent. □ 

Theorem 8. Finding an equilibrium in the fractional BBC game can be reduced to finding a personalized 
equilibrium in a matrix game. 

Proof. Consider any instance of fractional BBC. Create a player in the matrix game for each node in the 
BBC instance. Assign the player one action for each available edge in the BBC instance. For any hyperedge 
in the matrix game, a player's payoff is negative of the length of the shortest path to the destination made 
up of a subset of the edges represented by that hyperedge (or negative of the disconnection penalty if there 
is no such path to the destination). The proof that this preserves the set of equihbria is similar to the above 
proof for fractional BGP. □ 

4.1 Existence and Rational Solutions 

Theorem 9. For every multi-player matrix game, a personalized equilibrium always exists. 
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Proof. Given the matrix game G, we construct the /c-player game Q in which the i\h player's strategy space 
is the set of all probability distribution functions over Si and the payoff is given by the personalized payoff 
function defined above. Then a personalized equilibrium of G is equivalent to a Nash equilibrium of Q. 
By l[28l Proposition 20.3], a game has a pure Nash equilibrium if the strategy space of each player is a 
compact, non-empty, convex space, and the payoff function of each player is continuous on the strategy 
space of all players and quasi-concave in the strategy space of the player. The set of probability distributions 
over Si is clearly nonempty, convex, and compact. Furthermore, given probability distributions pi over Si, 
1 < i < k, the payoff for any player i is simply the solution to the following linear program with variables 
Wi{e), over e e E. 

max Wi{e)ui{e) 

Wiie) = pj{s) s £ Sj,l < j < k 
Wi{e) > ee E 

It is easy to see that the payoff function is both continuous in the probability distributions of all players, and 
quasi-concave in the strategy space of player i, thus completing the proof of the theorem. □ 

Theorem 10. For any matrix game with all rational payoffs, there exists a personalized equilibrium in which 
the probability assigned by each player to each strategy is a rational number. 

Proof. Let G be a A: -player matrix game. (Please refer to the beginning of Section|4]for relevant notation.) 
For each player i, let pi : Si ^ [0, 1] denote a probability distribution over its strategies. lfp= {pi, . . . ,pk) 
forms a personalized equilibrium, then it provides a feasible solution to the following linear program over 
variables Wi{e), where e G Hj 'S'j and 1 < i < k, and Pi{s), where 1 <i < k and s G Sf. 

Wi{e) = pj{s) s £ Sj,l < j < k,l < i < k 

e:see 

^Ms) =1 l<i<k (3) 

Wi{e) >0 l<i<k,ee E 

Furthermore, if p is a personalized equilibrium, then each {pi,Wi) pair maximizes '^eWi{e)ui{e) subject 
to LP ([3]). Suppose p is not a personalized equilibrium, yet satisfies LP Q. This is so if and only if there 
exists a player £ for which {pg, wg) does not maximize J2e we{e)ue{e). Suppose {p'^, w'^) with w'^ 7^ wi is 
an optimal choice for player £. Then, 6 = w'^ — wgis a. feasible solution to the following LP: 

^5(e)n,(e) >0 

ee-B 

J^,5(e) =0 seS„l<j<k (4) 
(5(e) > -W£{e) e£ E 

If F is the set of hyperedges for which 6{e) is negative, then 6{e) satisfies LP ^ only if W£{e) > for all 
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those hyperedges. This motivates replacing the last constraint of (|4]) with these two new constraints: 

Y,5{e)ut{e)>{) 

^(5(e) = seSj,l<3<k (5) 

e-.sde 

This LP, which we refer to as LP ([5]) is independent of w^, for each player £ and F (1 E. 

We have thus argued that p is a personalized equilibrium if and only if there exists w = {wi, . . . ,Wk) 
such that p and w satisfy LP ^ and, if LP ^ is feasible for some i and F, then W£{e) should not be positive 
for all e in F. We thus add the following constraints to LP Q: 

min w^(e) = 0, for alH and F such that LP Q is feasible. 

By taking all combinations of one hyperedge from each of the above product constraints, we get an expo- 
nential number of linear programs (with all rational coefficients), the union of which precisely describes all 
personalized equilibria. By Theorem [9} at least one of these linear programs is feasible, which implies that 
there exists a personalized equilibrium with all rational probabilities. □ 

4.2 Complexity of finding Personalized Equilibria 
4.2.1 Two Player Personalized Equilibria 

It is not hard to show that the set of all two-player personalized equilibria is convex. In fact, we can give a 
stronger characterization, which will lead to a polynomial time algorithm. 

Theorem 11. A 2-player personalized equilibrium can always be found in polynomial time. 

Proof. Recall the secondary definition of the personalized payoff to player ROW in a two-player game given 
at the start of Section |4j 

Payoff (ROW) is the cost of a 1-unit minimun-cost flow from source r to destination c in the directed 
graph Gr = {Vr, Er), with 

Vr = {r, c,ri,r2,...,rm,ci,C2,...,c„} 

Er = {(r ^ri), Vi}U{(ri ^9), Vi,j}U{(cj ^c), Vj}, 

where the capacity of edge (r r-i) is x[i], the capacity of edge {cj —>■ c) is y[j], and the capacity of all 
other edges is +oo. The cost of edge (rj —>■ cj) is —R[i,j], and the cost of all other edges is 0. 

A similar definition of a flow on a graph Gc gives the payoff function for player COLUMN. 

Now, let graph G = the union of Gr and Gc- We will now consider a subgraph G' = {V, E') C G, 
such that V" = Vr n Vc, {ri cj) G Er is in E' if and only if R[i,j] > R[i',j] for all i', and (cj rt) E 
Ec is in E' if and only if C[i, j] > C[i, f] for aU f. 

Any directed cycle in G' corresponds to a personalized equilibria. Consider any cycle 
{^ii, Cji,ri2, Cj2, . . • , rii,Cii} in G', each node played with weight j. Player ROW can match each of his 
strategies with player COLUMN'S strategy cj^. Since this is a best response for player ROW, ROW 
cannot do better by changing to another strategy. Similarly, player ROW can match each of his strategies 
Cjfc with player ROW's strategy rj(;j_|_i) for k < I, cji can be matched with rn. 

Every personalized equilibria is a linear combination of cycles in G'. Starting with any bipartite graph 
from G' in which the in-degree equals the out-degree of each node (a characteristic of any personalized 
equilibria), we can remove any cycle (which is a personalized equilibria) and we are still left with a bipartite 
graph with the same characteristic. □ 
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4,2,2 Multi-player personalized equilibria 

Theorem 12, For multiplayer games, the set of all personalized equilibria may not be convex. 



Proof. Consider the following example, with 3 players, 2 strategies per player. Player i has strategies ai 
and hi. Let Pi(ai, 02, 03) = the payoff to player 1 for hyperedge {ai, 02, 03}. The payoffs to player 1 are: 
Pi(ai, 02,03) = Pi(ai,62,63) = 1, ^1(61,02,63) = ^1(62, 62, 03) = 2, the other 4 payoffs for player 1 
are all 0. The payoffs for the other players are 1 for all hyperedges. In this example, the pure strategies 
01,02,03 and 01,62,63 are both equilibria. However, a combination of these two, oi = 1,02 = 03 = 
^, 62 = 63 = (1 — A), is not an equilibrium, since player p would prefer to play hyperedges {61, 02, 63} and 
{61,62,03}. □ 

We have shown that the preference game is a special case of fractional BGP, and we have shown that 
fractional BPG is a special case of finding personalized equilibria. Therefore, the non-convex example from 



Section 3.2 also proves theorem 12 



Theorem 13, It it PPAD-hard to find a personalized equilibria in general matrix games. 

Proof. Again, this is shown via the reduction from preference games to fractional BGP to personalized 
equilibria. □ 

Theorem 14, It is PPAD-hard to find 4-player personalized equilibria. 

Proof. For this theorem, we first note that when reducing from preference games to fractional BGP to 
personalized equilibria, we keep the same number of players. We also preserve the number of players 
"depended on" for the payoff of a particular strategy. In preference games, a payoff depends only on the 
single other player being chosen, and a player will only place weight on other players it prefers over itself. 
When reduced to fractional BGP, the paths considered by a node are only one or two-hop paths: and the node 
only considers two-hop paths that it prefers over its "direct" path. When reduced to finding personalized 
equilibria in a matrix game, we keep the same number of players. The payoff for a hyperedge depends only 
on a number of players equal to the number of hops in the path represented by that hyperedge; in this case, 
at most 2. The only strategies that will ever be chosen by a node are the strategies corresponding to players 
prefered over that node in the original instance of thepreference game. 

Therefore, if we start with the reduction from 3-DIMENSIONAL BROUWER used in Section |331 we 
have a graphical matrix game in which each node's strategy "depends on" one or two other nodes, and each 
node "influences" the strategy for one or two other nodes. (We can easily make this a max of 2, because if 
a node influences 3 other nodes, we just switch to influencing one of them plus a copy gadget, which can 
influence the other(s). So we can say that in-i-out degree is at most 4. Each node has 2 strategies. 

We want to represent this as a 4 player game with more strategies per player. 

First we have to slightly transform the graph in order to get two properties: 

1 . Our graph should have max degree of 3 (in -1- out) 

2. If a node "depends on" 2 other nodes, we want an edge (undirected is fine) between these two nodes. 
This edge counts in the degree. 

We have a degree 4 graph and we want to change it to degree 3. Any degree 4 node currently has 2 
inputs plus 2 outputs. We can change both of the outputs into a single copy gadget with 2 outputs - the copy 
has one input and 2 outputs, or degree three. 
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Now we have a graph with max degree 3, but we need the edges from property 2 (an edge between any 
two nodes X and Y that influence the same third node, Z). Suppose we have two nodes X and Y that both 
influence a third node Z, and node X has degree 3 already. Just add a new node X' that copies X, and make 
this copy influence Z. Now X still has degree 3, X' has degree 2 + the edge between X' and Y. 

After the above conversions, we have a graph with those 2 properties. Create a 3-coloring (possible 
because max degree is 3). Create one player per color. Each player has 2 strategies for each node in that 
color (one for the strategy of that node, one for the 1 strategy). 

Add dummy strategies as necessary so that each of the 3 players has the same number of strategies. Also 
add a fourth player with half the number of strategies as any other player. 

This gives us 4 players. Let the strategies for player 1 be {aio, an, 020, 021, • • • , a^.o, flfei}. The strate- 
gies for player 2 are {feio, 611, ... , hko, hki). The strategies for player 3 are {cio, cn, . . . , Cfco, Cki}- The 
strategies for player 4 are {di, ^2, . . . , djt}. 

Next we will assign payoffs for each hyperedge. Start by giving each hyperedge the same payoff as in 
the graphical game (we can do this because no two nodes influencing the same strategy are strategies of the 
same player). Notice that these payoffs will not depend at all on player 4. All of player 4's payoffs start at 
0. Let Pi{w, X, y, z) = the payoff to player i if player 1 plays w, 2 plays x, player 3 plays y, player 4 plays 
z. Now we want to add to these payoffs in order to ensure that each player plays each strategy pair equally. 

Let M > the largest payoff so far. Now, change the following payoffs: 

Pi{asi, X, y, ds)+ = M (player 1 is playing either strategy from the node numbered s, player 4 is play- 
ing his s*'* strategy). 

P2{w, bsi, y, ds)+ = M (player 2 is playing either strategy from the node numbered s, player 4 is playing 
his s^^ strategy). 

P3{w,x,Csi,ds)+ = M (player 3 is playing either strategy from the node numbered s, player 4 is playing 
his s*^ strategy). 

Pi{asi, X, y, d(^g^i-))+ = M (player 1 is playing either strategy from the node numbered s, player 4 is play- 
ing his s -I- strategy). 

If fi{x) = the amount player i plays strategy x then in any equilibrium we must have (for all s) 

fi{aso) + fi{asi) = fiids) 

f2{bso) + f2{bsl) = U{ds) 

h{cso) + h{Csl) = fiids) 

fiids) = /i(a(,._i)o) + /i(a(s-i)i)for s > 

f4{do) = /i(aiko) + /i(afci) 

These equations imply that: 

/i(aso) + /i(asi) = /i(a(5-i)o) + /i(a(s-i)i)for s > 

/i(aoo) + /i(aoi) = /i(afeo) + /i(afei)for s > 

f2{bso) + f2{bsi) = /i(aso) + /i(asi) 

h{cso) + faicsi) = /i(aso) + /iKi) 

In other words, given an equilibrium in this game, we can simply multiply by the number of pairs (nodes) 
per player to get an equilibrium in the graphical game. □ 
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With a slight modification, the above proof also establishes PPAD-hardness for approximating person- 
alized equilbria in 5-person games. We simply need to note that in the approximation gadgets created for 
the preference game hardness proof, the maximum length of any preference list is 4 instead of 3 (one of the 
players in the CORRECTION gadget "depends on" 3 other players). The rest of the proof remains intact. 

5 Concluding Remarks 

We note that our PPAD-hardness results from section [3] also apply to two other problems reduced in 
1X61 to fractional BGR The first of these problems is finding fractional stable matchings in hypergraphic 
preference systems. The second is finding fractional kernels in directed graphs. 

We raise a number of open questions. 

• Is finding a personalized equilibrium in general matrix games in PPAD? Although we show PPAD- 
hardness for general games, we have not settled the question of PPAD-membership. We have shown 
that a rational solution always exists, so finding an exact equilibrium may be in PPAD. 

• We show that it is possible to find a personalized equilibrium for a 2-person game in polynomial time, 
and it is PPAD-hard to find a personalized equilibrium in a 4-person game. However, the hardness of 
finding these equilibria in 3-person games remains open. 

• In this paper, we concentrate on a version of BBC games in which all nodes want to reach a single 
universal destination. However, in [22J, the utility of a node in a BBC game is defined as an affinity- 
weighted average of the shortest path length (or minimum cost flow in the fractional case) to all other 
nodes. They show that an equilibrium always exists with multiple destinations, and our hardness 
results of course extend to this model, since we can define only one non-zero affinity, but it is unknown 
whether rational equilibria always exist. 

• Our reduction from preference games to BBC games does not apply in the "multiple destinations" 
model if all affinities must be equal. Is this special instance of BBC games PPAD-hard as well? 

Personalized equilibria and preference games both have a number of real world and theoretical appli- 
cations, and seem to be natural end points in a spectrum of "personalized" fractional games. As more 
games are added to this hierarchy, we hope to fully understand the relationships and behavior of fractional 
equilibria. 
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